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INTRODUCTION 


In classical control and decision-making problems, the system is handled in a centralized 
fashion, namely, there is a single supervisor who handles all the information processing and decision 
making for the entire system. The decisions of control policies and their implementation are all 
made according to the preference of this central supervisor. 

By contrast, when a large-scale system is considered, such as those arising from the studies of 
socio-economic problems and electrical power systems, information processing and control policy 
decision are delegated to a set of agents. Generally, these agents have different information, differ- 
ent permissible control notices, and different preference orderings. These agents may act in com- 
plete independence; some may coordinate or supervise the actions of others and hence they form a 
certain hierarchical decision structure in the system. The behavior of the entire system will result 
from the interaction of all the decisions by these agents. Such an environment is called here decen- 
tralization decision making. 

This paper deals with a special class of decentralized control problem in which the objectives 
of the agents are to steer the state of the system to certain desirable levels. Such a problem is 
called the decentralized regulation problem. Each agent is concerned about certain aspects of the 
state of the entire system. The following defines notation: 
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The state variables are affected by the individual controls by the following differential equations: . 

X 2 = f 2 (X,U) 

(1 

X n =f n (X,U) . 


The objective of agent i is to use control to affect the system so that state Xj can approach a cer- 
tain desirable level as time /-*■<». Without losing generality, we can assume that such desirable 
levels are zero for all agents. 
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The information available to agent i at any time is assumed to be a mapping of the present 
state x to his data space, that is, for the ith agent information: 


y i = h i (x), h i *.Ri^R Pi . (2) 

Control strategy is assumed to be of a feedback form from information y f - to that is, 

n • ip - 

Control: u^~ 7 / -[/i / -(x)] , 7 j :R 1 -*■ R [ (3) 

The mapping 7 : above is chosen from the permissible set of functions T z -. Two sets of ques- 
tions arise from the above function: 

( 1 ) Stabilization: Given the structure (/j- , hj, r z ), is it feasible to find | 7 ; J such that x z - -*■ 0 as 
t -+ 00 for all /? If it is not feasible, what kind of structure modification will enable us to make it 
feasible? 


(2) Optimization: When it is feasible to regulate all the states, x z 0 as t -*■ what will be 
the optimal { 7 ^ to achieve such goals with respect to certain performance criteria? What is the 
impact of various information structures to the performance of regulation? 

To be more specific, we shall limit ourselves to linear time-invariant systems: 


where 


n 

x = Ax + B i u i 
i=l 


x T = {x x T , . . . ,x n r ) 


y i = H i x 


(iy 


( 2 )' 


and 

u i = F Pi 


(3)' 


where is a constant real matrix of appropriate dimension. 


STABILITY AND COORDINATION 


Agent i applies control « z = FjZj to regulate the state Xj so that its desirable level Xj = 0 will be 
achieved and maintained. Because the actions by all agents are coupled, the ability of agent / to 
regulate state x z depends on the actions of the other agent. On the other hand, a control u { of 
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agent i supposedly to decrease the deviation of Xj from zero may affect the other system state- 
variables Xj. This paper discusses the interactions of these individual regulation actions. 


Individual Stability 

System state variable Xj is said to be individually stable with feedback control gain Fj if agent i 
applies the control Uj =FjXj and all other agents take no actions (Fj = 0 Vi ¥^i) and t -*■ °° implies 
Xj -* 0. The collection of all such Fj is denoted by Sj. 


Collective Stability 

System state variable x is said to be collectively stable with feedback control gains (Fj, . . . , F n ) 
if, for all i , agent / applies control Uj = Fjyj and t implies Xj -*■ 0 for all i. The collection of all 
such (FjX . . . , xF n ) is denoted by S c . 

The following two observations can be made: 

(1 ) Fj E Sj V i does not imply (FjX . . . xF n ) G S c 

(2) Fj Sj V i does not imply (FjX . . . xF n ) ^ S c . 

These two facts are easily demonstrated by the following scalar example: 

Xj —~ Xj + ax 2 +Wj + u 2 
x 2 — ~x 2 Fax l +Ui + u 2 - 

Both and u 2 influence x, and x 2 . Agent 1 has access only to information x, and agent 2 has 
access only to information x 2 , that is, u x =/ 1 x 1 and u 2 = f 2 x 2 . For collective stability, the 
example requires that /, + f 2 < 1 -a. For individual stability, it requires that /, < 1 - a and 
f 2 < 1 -a. Set Si,S 2 , and S c are illustrated in figure 1(a) and (b). When -1 <a < 1, S x X S 2 < ^S C 
and when 1 < a, 5, X S 2 C S c . For —1 <a< 1 , when both agents use their Fj C Sj purposely to 
regulate their state, it is possible that none of them will achieve that goal. For 1 <a, any individual 
action Fj C S' will guarantee the results x ( - -*• 0 as / ->■ °° for i - 1 and 2. For 1< a, it may also be 

interesting to note that it is possible to have F.- ^ Sj for both i = 1 and 2 while x { )2 -► 0 as t -»■ °° 

If Fj C Sj for all / implies (FjX . . . xF n ) C S c , the system is called coordinated. In a coordi- 
nated system, each agent need only to assess his stability regime Sj. If each of them pick an Fj C Sj, 
everyone will achieve Xj -*■ 0. 

A noncoordinated system can be made coordinated by imposing constraints on the permissible 
control policies each agent can use. Such constraints should be imposed by a certain central 
coordinator (planner). For instance, in the example given above, when — 1 <a < 1, if both agents 
are first forbidden to pick a feedback gain > (1 - a)/ 2, the system becomes coordinated. 
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The concept of coordination in a system is important. In a coordinated system, agents have' 
greater degree of autonomy. Both the decision of his control policy and implementation of such 
control can be done by total decentralization. In a noncoordinated system, the solution of all con- 
trol policies for each agent may require the help of a central coordinator, which may involve very 
difficult computation. Substituting u j — and yj = HjX into x = Ax = "ZBjUj, it becomes 

i +ZB i F i H i )x (4) 

Obviously, the condition for collective stability is that all eigenvalues of A + ZBjFjHj have negative 
real parts. 

A more basic question to ask is that, for given structure matrices (A;, Bj, Hj), is it ever possible 
to find matrices (F, , . . . , F n ) that stabilize the system? Fisher and Fuller (ref. 1), McFadden 
(refs. 2, 3), and Aoki (ref. 4) have all tried to answer this stabilizability problem in certain aspects. 
The general stabilizability problem for system equation (4) has been solved recently by Wang and 
Davison (ref. 5). The problem of finding conditions on ( Aj , Bj, Hj, Fj ) such that the system is 
coordinated, controlled by agents and can fully be decentralized is still open. 


OPTIMIZATION OF DECENTRALIZED CONTROL 


When the system can be stabilized collectively, we would like to consider how to optimize the 
choice of (F, , . . . , F n ). Costs are associated with the deviation of the states from their desirable 
values and the magnitudes of the control “forces.” The objective of the central coordinator 
(planner) is to choose (Fj , . . . , F n ) to minimize these costs. Suppose that the costs are described 
by the following quadratic loss function: 

J =2 Sq ^ xT Q x + 'Zu i T RjUj)dt (5) 


where <2 > 0 and Rj > 0 V i . Since 


x = Dx 


where 


D = A + IBjFjHj (6) 

we have 

* - e Dt x 0 (7) 

where x 0 is the state deviation at t — 0. Then 
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1 = 2 C X ° T ' ^ r '( 2 + J:H i TF ‘ TR ‘ F ‘ H y > ' x o d ‘ 

= J trace fr"” ^'(q + ZH l T F l T R i F j H^e D ' dt x 0 xj^ . (8) 

It can be shown that, if D is stable, J can be expressed as (ref. 6) 

J = 2 trace ( Kx o x o ( 9 ) 

where 

KD + D T K +Q+ 'LH i T F i T R i F i H i = 0 . (10) 

Since D is linear in |F ; J, K solved by equation (10) is a rational function of |F f ^. The determina- 
tion of optimal |F ; | falls in the framework of classical nonlinear programming. 

It can be shown (ref. 7) that the gradient at the optimal choice of |F^ must satisfy 

0 = RjFflfLHT + BjKLH ? (11) 


where 


LD T + DL + x o x o T =0. (12) 

The necessary condition of optimality (eqs. (10)— (12)) is given here as the generalization of the 
single-agent output optimization problems studied by Levine and Athans (ref. 8). Note that some- 
times the initial value x Q is not known exactly. The product x 0 x 0 ^ in equations (9) and (12) should 
be replaced by its expected value. Furthermore, if we wish to find the optimal | F^ independent of 
the initial states, we could use an approach by assuming that the initial state is random and distri- 
buted uniformly in a sphere, namely, the expected value of x Q x 0 ^ is expressed as an identity matrix. 

What is expressed in equations (10), (1 1), and (12) is the necessary condition of optimal |F^; 
it is entirely possible to have more than one solution to such conditions. In any case, the deter- 
mination of optimal -jF^ involves many structure parameters and we cannot possibly expect elegant 
solutions unless the system is of very low dimension or of high dimension with nice structures (e.g., 
in symmetry or repetition). 

Since in practice only a high-dimension system justifies decentralized control, we should 
examine in the sequel some examples of high-dimension decentralized problems where the system 
is nicely structured. With such a system, we will be able to derive some results and to understand 
the relations between structure and control system behavior in more explicit terms. 
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SPECIAL LARGE-SCALE SYSTEMS 


Sequential Systems (ref. 9) 


The system can be decomposed into a sequence of subsystems, all identical in structure (see 
fig. 2)L The interactions among subsystems depends only on the distance, that is, the difference in 
the subsystem indices. - . 

* Subsystem state and control u z - 


Y T Y T y T ^ 

■ ■ ’ *i—l > */ > x /+] > • • ' 

(13) 

pi-rt+fV; 

(14) 

yt = £ • 

(15) 


For simplicity, the number of subsystems in the string is assumed to be infinite so that there are no 
ends in the string and the roles of all subsystems are identical. If a system comprises a large but 
finite number of subsystems, it can be treated as if the number were infinite by assuming fictitious 
subsystems at both ends of the string. 


Since all subsystems are identical, the feedback gains used in each control can also be assumed 
identical, that is, 


u^ — Fy^ for all i (16) 

Bilateral transformations can be used to convert the sequential subsystems into a lumped system in 
the z domain: 

(17) 

Z_ Uf' (18) 


etc. 

Optimal conditions of F can be obtained by converting equations (9), ( 1 0), ( 1 1 ), and ( 1 2) into 
appropriate counterparts involving a z transformation (ref. 9). 
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Large-Scale Systems in Arrays 


This configuration is illustrated in figure 3. It is similar to the sequential system in the previous 
section, except that the subsystems are distributed in two-dimensional arrays. If the structures and 
interactions of all subsystems are all identical subject only to index translation, double bilateral 
^-transformation techniques can be used to solve the optimization problem. Such models could 
represent situations of street traffic control, huge electric network control, etc. 

Regulation of Vehicular Strings (ref. 10)— A string of high-speed, densely packed vehicles are 
moving along a certain guideway. It is desirable to keep the spacings and velocities of all vehicles 
in the string as close as possible to certain predetermined values (see fig. 4). The position deviation 
of the kth vehicle from its predetermined reference is denoted by x k . The dynamics of the kth 
vehicle can be described by the second-order, normalized differential equation: 

x k + axj. = Uf c ; - oo < £ < oo ( 19 ) 

The total information data for the kth vehicle is denoted by a vector y k given by 


H “ E H k-i 

The structure of equation (20) includes the following special cases: 

(i) yk T = ( x k-\> x k’ x k+\’ x k-l’ x k> x k+0 

(ii) y k T = (x k , x k ) 

(hi) yk T =0 c k-l> x k> x k+ 1> 

(iv) y k T = (x k ) 

(v) y k T = (xj, Xj I - °° < / < oo) . 

The linear feedback control used by each vehicle is of the same form: 

u k = F y k ; - °° < k <°° . (21) 

The regulation cost is assumed to be of the following form where the magnitude of the control 
forces and relative vehicle position errors are penalized. 


— oo<k< oo 


( 20 ) 




( 22 ) 
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Results- If a = 1 , q = 10, and the information is given as 


H = ( x kA> x k> x kH’h-\'h>h+\) T . (23) 

The optimal control for each vehicle is 

u k=fo x k+fi( x k-\ +x k+0 + S o x k +gi(.x k .i +**+!> 


where 


fo ~ “4.06 , A =1.43 , g 0 =1-94, g, = 0.52 . (24) 

The associated minimal cost is 


7* = 7.59. (25) 

Figure 5 represents the stability region as well as the sensitivity of J to various choices of feedback 
gains. Figure 6 shows the behavior of a string of 1 0 vehicles when the optimal control scheme is 
applied. The position of each vehicle is plotted relative to a coordinate system moving with the 
desired velocity. At t — 0, all vehicles are subject to random perturbations in position and velocity. 
Also, vehicle 5 is constantly driven with a sinusoidal disturbance force. For comparison, Figure 7 
shows the response of the 1 0 vehicles when the controllers chosen are too sensitive to the motion of 
the neighboring vehicles — the result is a chain collision. 


INFORMATION STRUCTURE DESIGN 


Structure Design versus Stabilizability 

The problem of stabilizability for a given structure ( A , Hj) was discussed under Stability 

and Coordination. A more basic problem is how the information structures influence the stabiliz- 
ability. In other words, if a system cannot be stabilized initially by the imposed decentralized 
scheme, what kind of change in information structure can induce stabilizability? 

Example— In the vehicular string regulation problem presented earlier, it can be shown that, if 
the information of vehicle k's state x k is available to him, the system can be made stable by appro- 
priate choice of the feedback gains. If the information available to the controllers is only velocity 
data with no position data, the entire system can never be stabilized by any choice of feedback 
gains. 

Example — Assignment Problem (ref. 3)^ In a special decentralized regulation problem, 

x = ZBjUj . (26) 

If control agent / knows only about the ith component of x, the feedback will be of the following 
form: 
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u i = F i x i 


(27) 


hence 


x = [B t ,B 2 , . . . ,B n ] 



(28) 


A nonstabilizable system could possibly be stabilized by permiting the information available to each 
agent. It has been shown that if the matrix [B x , B 2 , . . . , B n ] is nonsingular, it is always possible to 
stabilize the system of equation (28) by an appropriate permutation of the information structure, 
that is, 



= Px 


where P is a permutation matrix (refs. 3, 11). 


Structure versus Regulation Performance 

As expressed in equation (5), the regulation performance of the entire system is given by a 
certain cost function J. When the regulation is feasible (stabilizable), it is interesting to ask how 
each piece of information contributes to the optimization of J. A useful concept of the value of 
information can be defined. Simply speaking, the value of information can be visualized as the 
difference between the optimal J* with the given information and the optimal J* obtained without 
that particular information. 

Reconsider the vehicular string regulation problem. A list of certain information structures 
with the optimal J* of each is given below (see also fig. 8). 


Information available to the &th vehicle 

Optimal 
J * 

Case A: 

x k-\> x k’ x k+ 1’ x k- 1> x k’ x k + 1 

7.59 

Case B: 

x k-\ > x h x k + 1 

1 1.30 

Case C: 

x k’ x k 

8.13 

Case D: 

x k 

11.81 

Case E: 

(nothing) 

OO 

Case F : 

\ x j,Xj i-oo</ <oo| 

7.54 
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If each vehicle is provided with its own state information x j c , is 1181. The following are 
values of certain added pieces of information for the &th vehicle: 


(i) 

*k 

3.68 

J*{D)-J*(C ) 

(ii) 

x k- 1 and *fc+ 1 

.50 

J*{D)~J*(B ) 

(hi) 

x k- 1 » x k+l’ x k-l’ *&+l 

.54 

J*(C) - J*(A) 

(iv) 

x k - 1 and x k+\ 

.03 

7*(iii)-7*(ii) 

(v) 

| Xj, x j 1 for all 1/ —k i > 2^ 

.05 



For regulating this vehicular system, the controller’s own velocity data x^. is much more 
important than the data x^.j and x^+j . Moreover, the remote input data |x.-, x.- for I j — k I > 2} 
do not significantly add to the optimization of the regulation performance. If tne structure in 
case A is adopted, despite the fact that much less information is required than for case F, the per- 
formance index value is very close to the ultimate minimum. 

In real implementation of various control schemes, it is important to consider the cost of 
installing various measurement and control mechanisms and the feasibility of establishing various 
links for data and controls. The concept of information value discussed here provides a quantitative 
method for measuring and comparing the relative merits of different information structures and 
hence the usefulness of the different information provided. 
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CASE -1 < a < 1 


CASE 1 < a 



(a) Not coordinated. 


(b) Coordinated. 


Figure 1.— Example relations between individually and collectively stable systems. 
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DISPLACEMENTS 



Figure 6.— Behavior of an optimally regulated Figure 7.— Behavior of a vehicular string with 
string of 10 vehicles with the 5th vehicle excessive regulation, 

oscillating. 
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Figure 8. 
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Information value of various information structures in a regulated vehicular string. 
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